Robot Weightlifting By Direct Policy Search

نویسندگان

  • Michael T. Rosenstein
  • Andrew G. Barto
چکیده

This paper describes a method for structuring a robot motor learning task. By designing a suitably parameterized policy, we show that a simple search algorithm, along with biologically motivated constraints, offers an effective means for motor skill acquisition. The framework makes use of the robot counterparts to several elements found in human motor learning: imitation, equilibrium-point control, motor programs, and synergies. We demonstrate that through learning, coordinated behavior emerges from initial, crude knowledge about a difficult robot weightlifting task.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Advancing student interaction with a learning robot: Digital twin, connectivity, and augmented reality

During interaction with learning robots, students often experience difficulties understanding the robot intent and its practical realization. To address this challenge, we propose a connected environment that integrates the robot, its digital twin and virtual sensors. The environment supports digital experiments that enable the physical robot to determine optimal policy for performing a manipul...

متن کامل

Reward-Weighted Regression with Sample Reuse for Direct Policy Search in Reinforcement Learning

Direct policy search is a promising reinforcement learning framework, in particular for controlling continuous, high-dimensional systems. Policy search often requires a large number of samples for obtaining a stable policy update estimator, and this is prohibitive when the sampling cost is expensive. In this letter, we extend an expectation-maximization-based policy search method so that previo...

متن کامل

Efficient Sample Reuse in EM-Based Policy Search

Direct policy search is a promising reinforcement learning framework in particular for controlling in continuous, high-dimensional systems such as anthropomorphic robots. Policy search often requires a large number of samples for obtaining a stable policy update estimator due to its high flexibility. However, this is prohibitive when the sampling cost is expensive. In this paper, we extend an E...

متن کامل

Weightlifting Motion Planning for a Puma 762 Robot

In this paper we develop a point-to-point weightlifting motion planner for open-chained rvbota. The joint trajectm”es am defined b~ B-spline pol~omials along with a time-scale factor. Ph@al limitations of a Puma 762 robot a~ incorpomted into the formulation. The torque limits are formulated aa a Penaltg function (soft constminta) added into the objeetive function while the position and velocity...

متن کامل

Evolutionary Policy Transfer and Search Methods for Boosting Behavior Quality: RoboCup Keep-Away Case Study

This study evaluates various evolutionary search methods to direct neural controller evolution in company with policy (behavior) transfer across increasingly complex collective robotic (RoboCup keep-away) tasks. Robot behaviors are first evolved in a source task and then transferred for further evolution to more complex target tasks. Evolutionary search methods tested include objective-based se...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001